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Abstract. A Boolean constraint satisfaction instance is a conjunction of 
constraint applications, where the allowed constraints are drawn from a 
fixed set C of Boolean functions. We consider the problem of determining 
whether two given constraint satisfaction instances are equivalent and 
prove a Dichotomy Theorem by showing that for all sets C of allowed 
constraints, this problem is either polynomial-time solvable or coNP- 
complete, and we give a simple criterion to determine which case holds. 
A more general problem addressed in this paper is the isomorphism prob- 
lem, the problem of determining whether there exists a renaming of the 
variables that makes two given constraint satisfaction instances equiva- 
lent in the above sense. We prove that this problem is coNP-hard if the 
corresponding equivalence problem is coNP-hard, and polynomial-time 
many-one reducible to the graph isomorphism problem in all other cases. 

Keywords: computational complexity, propositional logic, constraint 
satisfaction problems, logic in computer science 
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1 Introduction 

In 1978, Thomas J. Schaefer proved a remarkable result. He examined the sat- 
isfiability of propositional formulas for certain syntactically restricted formula 
classes. Each such class is given by the finite set C of Boolean functions allowed 
when constructing formulas. A C-formula in his sense now is a conjunction of 
clauses, where each clause consists of a Boolean function from C applied to some 
propositional variables. Such a Boolean function can be interpreted as a con- 
straint that has to be fulfilled by a given assignment; the satisfiability problem 
for C-formulas hence provides a mathematical model for the examination of the 
complexity of constraint satisfaction problems, studied in artificial intelligence 
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and database theory. Let CSP(C) denote the problem of deciding whether a 
given C-formula is satisfiable. Schaefer |Sch78| showed that, depending on C, the 
problem CSP(C) is cither (1) efficiently (i.e., in polynomial time) solvable or (2) 
NP-complete (see also [GJ79, Problem L06]); and he gave a simple criterion that 
allows one to determine whether (1) or (2) holds. Since the complexity of CSP(C) 
is either easy or hard (and not located in one of the — under the assumption P =/= 
NP — infinitely many intermediate degrees between P and the NP-complete sets 



[Lad75]), Schaefer called this a "dichotomy theorem for satisfiability." 



In recent years, there has been renewed interest in Schaefer's result and con- 
straint satisfaction problems. 



N. Creignou examined in [Cre95| how difficult it is to find assignments to con- 
straint satisfaction problems that do not necessarily satisfy all clauses but as 
much as possible. Together with Hermann she studied the difficulty of determin- 
ing the number of satisfying assignments of a given constraint satisfaction prob- 



lem in [CH96|. In [CH97| Creignou and Herbrard discussed algorithms that gen- 
erate all satisfying assignments, turning their attention to the question whether 
such an algorithm, given the fact that it has already found a satisfying assign- 
ment, can find another one in polynomial time. Kirousis and Kolaitis researched 
the complexity of fi nding minimal satisfying assignments for constraint satis- 
faction problems in KK01 | and Khanna , Sudan a nd Trevisan examined the ap- 
proximability of these problems [ KST97 , KSW97 1 . Reith and Vollmer had a look 
at lexicographical minim al o r maxim al satisfying assignments constraint satis- 
faction problems flRV00| . In |RW0(fl Reith and Wa gner examined very closely 
various problems in the vicinity of constraint satisfaction such as the circuit 
value problem, counting and threshol d probl ems for restricted classes of Boolean 
circuits. The PhD thesis of S. Reith | Rci01 contains a wealth of results about 
problems dealing with restricted Boolean circuits, formulas, and constraint sat- 
isfaction. 



As mentioned above, constraint satisfaction problems are used as a program- 
ming or query language in fields such as artificial intelligence and database the- 
ory, and the above complexity results shed light on the difficulty of design of 
systems in that areas. A problem of immense importance from a practical per- 
spective is that of determining whether two sets of constraints express the same 
state of affairs (that is, arc equivalent), for example, in the applications, if two 
programs or queries are equivalent, or if a program matches a given specification. 
Surprisingly, this problem has not yet been looked at from a complexity point 
of view. In the case of unrestricted prepositional formulas, the equivalence prob- 
lem is easily seen to be complete for coNP. The main result of the present paper 
(Theorem ||) is a complete classification of the complexity of determining if two 
given constraint satisfaction instances are equivalent. We consider constraints 
drawn from a fixed arbitrary finite set C of Boolean functions and show that for 
all such C, the considered problem is either (1) solvable in polynomial time, or 
(2) complete for coNP. As in Schaefer's result, our proof is constructive in the 
sense that it allows us to easily determine, given C, if (1) or (2) holds. 



Besides the immediate practical relevance of the equivalence problem, we 
also see our results as contributions to the study of two other decision problems: 
First, the equivalence problem is a "sub-problem" of the minimization problem, 
i.e., the problem to find out, given a set of constraints, if it can equivalently be 
expressed with a fewer number of constraints. Secondly, equivalence relates to 
the isomorphism problem, which has been studied from a theoretical perspec- 
tive for various mathematical structures. Most prominently, the question if two 
given (directed or undirected) graphs are isomorphic is one of the few pro blems 
in NP neither known to be in P nor known to be NP-complete KST93 |. The 
most recent news about graph isomor phism are a number of hardness results 
(e.g., for NL, PL, an d DET) given in [ PorOC | . Related to our study are the pa- 
pers |ATO0| , |BRS98| presenting a number of results concerning isomorphism of 
propositional formulas. In Section |J, we show (Theorem [l8]) that the isomor- 
phism problem for constraint applications is coNP-hard if the corresponding 
equivalence problem is coNP-hard, and polynomial-time many-one reducible to 
the just mentioned graph isomorphism problem in all other cases. We also show 
that for a number of these cases, the isomorphism problem is in fact polynomial- 
time many-one equivalent to graph isomorphism (Theorems |2^ and |2^). The 
same proof technique can be used to prove a general, non-trivial (parallel 
access to NP) upper bound for the isomorphism problems for constraint satis- 
faction (Theorem S3) . 



2 Boolean Constraint Satisfaction Problems 

We start by formally introducing constraint satisfaction problems. The defini- 
tions necessary for the equivalence and isomorphism problems will be given in 
the upcoming sections. 

Definition 1. 1. A constraint C (of arity k) is a Boolean function from {0, l} k 
to {0,1}. 

2. If C is a constraint of arity fc, and xx, %2, ■ ■ ■ , %k are (not necessarily distinct) 
variables, then C(xi, X2, • ■ ■ , Xk) is a constraint application of C . 

3. If C is a constraint of arity k, and for 1 < i < k, Xi is a variable or a 
constant (0 or 1), then C(xi, X2, ■ ■ ■ , x^) is a constraint application of C 
with constants. 

The decision problems examined by Schacfcr arc the following. 
Definition 2. Let C be a finite set of constraints. 

1. CSP(C) is the problem of, given a set S of constraint applications of C, to 
decide whether S is satisfiablc, i.e., whether there exists an assignment to 
the variables of S that satisfies every constraint application in S. 

2. CSP C (C) is the problem of, given a set S of constraint applications of C with 
constants, to decide whether S is satisfiable. 



Clearly, there are an infinite number of CSP(C) problems. In 1978, Schaefer 
proved the surprising result that constraint satisfiability problems are either in P 
or NP-complete. He also completely characterized for which sets of constraints 
the problem is in P and for which it is NP-complete. Consult the excellent 
monograph [CKSOO for an almost completely up-to-date overview of further 
results and dichotomy theorems for constraint satisfaction problems. 

The question of whether satisfiability for CSPs is in P or NP-complete de- 
pends on those properties of the involved Boolean functions that we define next. 



Definition 3. Let C be a constraint. 



— C is 0-valid if C(0) = 1. 

— C is 1-validif C(l) = 1. 

— C is Horn (a.k.a. weakly negative) if C is equivalent to a CNF formula where 
each clause has at most one positive variable. 

— C is anti-Horn (a.k.a. weakly positive) if C is equivalent to a CNF formula 
where each clause has at most one negative variable. 

— C is bijunctive if C is equivalent to a 2CNF formula. 

— C is affine if C is equivalent to an XOR-CNF formula. 

— C is complementive (a.k.a. C-closed) if for every s € {0, 1} , C(s) = C(s), 
where k is the arity of C and s = dof (1 — si)(l — S2) ••■(! — s/.) for s = 

SlS 2 • ■ • Sfe. 

Let C be a finite set of constraints. We say C is 0-valid, 1-valid, Horn, anti-Horn, 
bijunctive, affine, or complementive if every constraint C G C is 0-valid, 1-valid, 
Horn, anti-Horn, bijunctive, affine, or complementive, respectively. Finally, we 
say that C is Schaefer if C is Horn or anti-Horn or affine or bijunctive. 

Schaefer's theorem can now be stated as follows. 



Theorem 4 (Schaefer |Sch78|| ). Let C be a finite set of constraints. 



1. If C is 0-valid, 1-valid, or Schaefer, then CSP(C) is in P; otherwise, CSP(C) 
is NP-complete. 

2. If C is Schaefer, then CSP C (C) is in P; otherwise, CSP C (C) is NP-complete. 

In this paper, we will study two other decision problems for constraint satis- 
faction problems. In the next section, we will look at the question of whether two 
given CSPs are equivalent. In Section ^, we address the isomorphism problem 
for CSPs. In both cases, we will prove dichotomy theorems. 



3 The Equivalence Problem for Constraint Satisfaction 

The decision problems studied in this section are the following: 
Definition 5. Let C be a finite set of constraints. 



1. EQUIV(C) is the problem of, given two sets S and U of constraint applica- 
tions of C, to decide whether S and U are equivalent, i.e., whether for every 
assignment to the variables, S is satisfied if and only if U is satisfied. 

2. EQUIV C (C) is the problem of, given two sets S and U of constraint applica- 
tions of C with constants, to decide whether S and U are equivalent. 

It is immediate that all equivalence problems are in coNP. Note that, in some 
sense, equivalence is at least as hard as non-satisfiability, since S is not satisfiable 
if and only if S is equivalent to 0. Thus, we obtain immediately that if C is not 
Schaefer, then EQUIV C (C) is coNP-complete. 

On the other hand, equivalence can be harder than satisfiability. For example, 
equivalence between Boolean formulas with A and V (i.e., without negation) is 
coNP-complete EG95[ while non-satisfiability for these formulas is clearly in P. 



We will prove following dichotomy theorem. 

Theorem 6. LetC be a set of constraints. IfC is Schaefer, then EQUIV(C) and 
EQUIV C (C) are mP; otherwise, EQUIV(C) and EQUIV C (C) are coNP- complete. 

The cases of constraints with polynomial-time equivalence problems are easy 
to identify, using the following theorem: 

Theorem 7. EQUIV C (C) is truth-table reducible to CSP C (C). 

Proof. Let S and U be two sets of constraint applications of C with constants. 
Note that S and U are equivalent if and only if U — > A for every constraint 
application A £ S, and S — ► B for every constraint application B G U (see, e.g., 



[EK92Q. Here and in the following, when we write a set of constraint applications 
S in a Boolean formula, we take this to be a shorthand for f\2 eS A. 

Given a constraint application A with constants and a set S of constraint 
applications of C with constants, it is easy to check whether S — > A with at most 
2 fc truth-table queries to CSP C (C), where k is the maximum arity of C: For every 
assignment to the variables in A that does not satisfy A, substitute this partial 
truth assignment in S. S — > A if and only if none of these substitutions results 
in a satisfiable set of constraint applications. □ 

If C is Schaefer, then CSP C (C) is in P by Schacfcr's theorem and we immedi- 
ately obtain the following corollary. 

Corollary 8. IfC is Schaefer, then EQUIV C (C) is in P. 

Having identified the easy equivalence cases, the following theorem proves 
the second half of our Dichotomy Theorem ||: 

Theorem 9. IfC is not Schaefer, then EQUIV(C) is coNP -hard. 

First of all, note that this would be easy to prove if we had constants in the 
language, since for all sets S of constraint applications of C the following holds: 
S is not satisfiable if and only if S is equivalent to the constraint 0. Still, we can 
use this simple observation in the case where C is not 0-valid and not 1-valid. 



Claim 10 If C is not Schaefer, not 0-valid, and not 1-valid, then EQUIV(C) is 
coNP -hard. 



Proof. We will reduce CSP(C) to EQUIV(C). Let 5 be a set of constraint appli- 
cations of C. As noted above, 5 is not satisfiable if and only if 5 is equivalent to 
0. Let Co £ C be a constraint that is not 0-valid, and let C\ £ C be a constraint 
that is not 1-valid. Note that, for any variable x, {Cq(x, . . . , x), C±(x, . . . , x)} 
is equivalent to 0, and thus 5 £ CSP(C) if and only 5 is equivalent to 
{C (y,...,y),Cx(y,...,y)}. □ 

If C is not Schaefer, but is 0-valid or 1-valid, then every set of constraint 
applications of C is trivially satisfiable (by or 1). In these reduction 
from CSP(C) will not help, since CSP(C) is in P. However, we will show that 
in these cases the problem of determining whether there exists a non-trivial 
satisfying assignment is NP-complete and we will use the complements of these 
satisfiability problems to reduce from. 

Creignou and Hebrard prove the following result, concerning the existence 



of non-trivial satisfying assignments (|CH97, Proposition 4.7], their notation for 
our CSP #0 ,i is SAT*): 



Proposition 11 ([ |CH97| ). If C is not Schaefer, then CSP^ ,i(C) is NP- 



complete, where CSP^o.i(C) is the problem of, given a set S of constraint appli- 
cations of C, to decide whether there is a satisfying assignment for 5 other than 
and 1. 

CSP^o.i(C) corresponds to the notion of "having a non-trivial satisfying 
assignment" in the case that C is 0-valid and 1-valid. We will reduce CSP^o,i(C) 
to EQUIV(C) in this case in the proof of Claim [l5] to follow. 

For the cases that C is not 1-valid or not 0-valid, we obtain the following 
analogues of Proposition A proof can be found in the appendix. 

Theorem 12. 1. If C is not Schaefer and not 0-valid then CSP^i(C) is NP- 
complete, where CSP^i(C) is the problem of, given a set S of constraint 
applications of C, to decide whether there is a satisfying assignment for S 
other than 1. 

2. If C is not Schaefer and not 1-valid then CSP^o(C) is NP-complete, where 
CSP^o(C) is the problem of, given a set S of constraint applications of C, to 
decide whether there is a satisfying assignment for S other than 0. 

Proof. Careful inspection of Creignou and Hcbrard's proof of Proposition [□] 
shows that following holds if C is not Schaefer: 

1. If C is not 0-valid and not 1-valid, then C = {S \ S £ CSP^ ,i(C) and not 
5(0) and not 5(1)} is NP-complete (this is case 1 of Creignou and Hebrard's 
proof). 

2. If C is 0-valid and not 1-valid, then £ = {S \ 5 £ CSP #0 ,i(C) and not 5(1)} 
is NP-complete (this is case 2b of Creignou and Hebrard's proof). 

3. If C is 1-valid and not 0-valid, then & = {5 | 5 £ CSP #0 ,i(C) and not 5(0)} 
is NP-complete (this is case 3b of Creignou and Hebrard's proof). 



This almost immediately implies Theorem [12]. Let C be not Schaefer and not 
0- valid. If C is not 1-valid, then C trivially many-one reduces to CSP^i(C), since, 
for 5 a set of constraint applications of C, 5 £ C if and only if not 5(0), not 
5(1), and 5 £ CSP^i(C). Similarly, if C is 1-valid, then C\ trivially many-one 
reduces to CSP^i(C). This proves part (1) of Theorem |l2|. Part (2) follows by 
symmetry. □ 

Claim 13 Let C be a finite set of constraints. 

1. If C is 1-valid, not Schaefer, and not 0-valid, then EQUIV(C) is coNP -hard. 

2. If C is 0-valid, not Schaefer and not 1-valid, then EQUIV(C) is coNP -hard. 

Proof. We will prove the first case; the proof of the second case is similar. We 
will reduce CSP^i(C) to EQUIV(C) as follows. Let 5 be a set of constraint 
applications of C and let x\, . . . , x n be the variables occurring in 5. Note that 1 
satisfies 5, since every constraint in 5 is 1-valid. Therefore, 5 ^ CSP^i(C) if and 
only if 5 is equivalent to AILi x i- Let C £ C be not 0-valid. Since C is 1-valid, 
Xj is equivalent to C(xi, . . . ,Xi). It follows that 5 g" CSP^i(C) if and only if 5 
is equivalent to {C(x i7 . . . , x^ | 1 < i < n}. □ 
The final case is where C is both 0-valid and 1-valid. We need the following key 
lemma from Creignou and Hebrard which is used in their proof of Proposition |Tl] . 



Lemma 14 ( [|CH97|1 , Lemma 4.9(1)). Let C be a set of constraints that is 



not Horn, not anti-Horn, not affine, and 0-valid. Then either 

1. There exists a set Vq of constraint applications of C with variables x and y 
and constant such that Vq is equivalent to x — > y, or 

2. There exists a set Vq of constraint applications of C with variables x, y, z and 
constant such that Vq is equivalent to (x A y A z) V (x A y A z) V (x A y A z) . 



Claim 15 Let C be a finite set of constraints. If C is not Schaefer but both 
0-valid and 1-valid, then EQUIV(C) is coNP -hard. 

Proof. We will reduce CSP^ ,i(C) to EQUIV(C). Let 5 be a set of constraint 
applications of C and let x\,...x n be the variables occurring in 5. Note that 
and 1 satisfy 5, since every constraint in 5 is 0-valid and 1-valid. Therefore, 
5 CSP^o,i(C) if and only if 5 is equivalent to A"=i x i v A"=i ~i- 

First, suppose there is a constraint C £ C that is non-complementive. (This 
case is similar to Creignou and Hebrard's case 2a). Let k be the arity of C and 
let s £ {0, l} fc be an assignment such that C(s) = 1 and C(s) = 0. Let A(x,y) 
be the constraint application C{a\, . . . , at), where at = y if Si = 1 and at = x if 
s t = 0. Then A(0,0) = A(l, 1) = 1, since A is 0-valid and 1-valid; A(0, 1) = 1, 
since C(s) ~ 1; and A(1,0) = 0, since C(s) — 0. Thus, A(x,y) is equivalent to 
x — > y. Since A"=i x i V A"=i^" ^ s equivalent to Ai<i j< n ( x i ~~ * x j)i ^ follows 
that 5 ^ CSP^o,i(C) if and only if 5 is equivalent to /\ 1<i j <n A(xi,xj). 

It remains to consider the case where every constraint in C is complementivc. 
Let Vq be the set of constraint applications of C with constant from Lemma H4. 



Let Vf be the set of constraint applications of C that results when we replace each 
occurrence of in Vo by /, where / is a new variable. Note that the following 
holds. There are two cases to consider, depending on the form of Vo- 

Case 1: Vo(x, y) is equivalent to (x — > y). In this case, consider V/(/, x, y). Since 
Vf (0, x, y) is equivalent to x — > y, and every constraint in S is complementive, 
it follows that Vf(f, x, y) is equivalent to (/A (x — > y)) V (f A(y — > x)). Thus, 
AlLi V A"=i ^ S equivalent to Ai<i j< n Vf (/> x ij ^jOj an d ^ follows that 
5 ^ CSP^o,i(C) if and only if S is equivalent to Ai<j j< n Vf (/> ^ ^j)- 

Case 2: Vo(x, y, z) is equivalent to (x A y A z) V (x A y A z) V (x A y A z). 
Since all constraints in Vq arc complementive, Vf(f,x,y,z) behaves as 
follows: V> (0,0, 0,0) = 1^/(0,1,0,1) = V f (0, 0,1,1) = F/(l, 1,1,1) = 
Vf(l, 0, 1, 0) = Vf(l, 1, 0, 0) = 1, and Vf is for all other assignments. Note 
that Vf(f, /, Xj, Xj) is equivalent to (x, <-> Xj), and thus that A»=i x i^Ai=i %i 
is equivalent to /\i<ij< n 

V f (f,f,Xi, Xj ). It follows that S CSP /0 ,i(C) if 
and only if S is equivalent to Ai<j j< n (/) /j ^ ^i)- '-' 

4 The Isomorphism Problem for Constraint Satisfaction 

In this section, we study a more general problem: The question of whether a set 
of constraint applications can be made equivalent to a second set of constraint 
applications using a suitable renaming of its variables. We need some definitions. 

Definition 16. 1. Let X = {x\, . . . , x n } be a set of variables. By ir: {x\, . . . , 
x n} —* {%i, ■ ■ ■ i %n} we denote a permutation of X. 

2. Let S be a set of constraint applications over variables X and tt a permuta- 
tion of X. By ir(S) we denote the set of constraint applications that results 
when we replace simultaneously all variables X{ of S by 7r(xj). 

3. Let S be a set of constraint applications over variables X. 
The number of satisfying assignments of S is #i(S) = doi 

| { / | / is an assignment to all variables in X that satisfies every constraint 
application in S}\\. 

The isomorphism problem now is formally defined as follows: 

Definition 17. 1. ISO(C) is the problem of, given two sets S and U of con- 
straint applications of C over variables X, to decide whether S and U are 
isomorphic, i.e., there exists a permutation n of X such that n(S) is equiv- 
alent to U. 

2. ISOc(C) is the problem of, given two sets S and U of constraint applica- 
tions of C with constants over variables X, to decide whether S and U are 
isomorphic. 

We remark that for S and U to be isomorphic, we require that formally they 
are defined over the same set of variables. Of course, this does not mean that all 
these variables actually have to occur textually in both formulas. 



As in the case for equivalence, isomorphism is in some sense as least as hard 
as non-satisfiability, since S is not satishable if and only if S is isomorphic to 0. 
Thus, we immediately obtain that if C is not Schaefer, then ISO c (C) is coNP- 
hard. Unlike the equivalence case however, we do not have a trivial coNP upper 



bound for isomorphism problems. In fact, there is some evidence [ ATOC] that the 
isomorphism problem for Boolean formulas is not in coNP. Note that determin- 
ing whether two formulas or two sets of constraint applications are isomorphic 
is trivially in However, the isomorphism problem for formulas is not in- 



complete unless the polynomial hierarchy collapses ATOC ] . In the sequel (Theo- 



rem 24|) we will prove a stronger result for the isomorphism problem for Boolean 
constraints: We will prove a Pj^ P upper bound for these problems, where Pj| p 
is the class of problems that can be solved via parallel access to NP. This class 



has many different characterizations, see, for example, Hemaspaandra [HcmS 
Papadimitriou and Zachos [ PZ83| , Wagner HWag9CH . 



For equivalence, we obtained a polynomial-time upper bound for sets of con- 
straints that are Schaefer. In contrast, as we will show in the sequel, it is easy 
to see that, for example, isomorphism for positive 2CNF formulas (i.e., iso- 
morphism between two sets of constraint applications of {(0, 1), (1, 0), (1, 1)}) is 
polynomial-time many-one equivalent to the graph isomorphism problem (GI). 

The main result of this section is the following theorem. 

Theorem 18. Let C be a set of constraints. If C is Schaefer, then ISO(C) and 
ISO c (C) are polynomial-time many-one reducible to GI, otherwise, ISO(C) and 
ISO c (C) are coNP-hard. 

Note that if C is Schaefer, then the isomorphism problem ISO(C) cannot 
be coNP-hard, unless the polynomial-time hierarchy collapses. (This follows 
since, by our theorem, if ISO(C) is coNP-hard then GI is coNP-hard and, since 
GI e NP, coNP would be a subset of NP and thus NP=coNP which implies the 
mentioned collapse.) Under the assumption that the polynomial-time hierarchy 
does not collapse, Theorem [l8] thus distinguishes an easy case (reducible to GI) 
and a hard case. In this sense, Theorem |l8| is again a dichotomy theorem. 

We will first have a look at the lower bound part of Theorem [H| For that 
we need the following property: 

Lemma 19. Let S and U be sets of constraint applications of C with constants. 
If S is isomorphic to U then ffi(U) = =ft=i(S). 

Proof. First note that every permutation of the variables of S induces a permu- 
tation of the rows of the truth-table of S. Now let it be a permutation such that 
ir(S) = U. Then # x (5) = #i(tt(S)) and #i(tt(5)) = #i(U). □ 

Theorem 20. If C is not Schaefer, then ISO(C) is coNP-hard. 



Proof. We first note that a claim analogous to Claim 10 also holds for 
isomorphism, i.e., if C is not Schaefer, not 0- valid, and not 1-valid, then 
ISO(C) is coNP-hard. For the proof, we use the same reduction as in 



the proof of Claim and claim that, again, S £ CSP(C) if and only 
if S is isomorphic to {Co{y, ...,y), C±(y, y)}. For the direction from 
left to right note that if (S, {C (y, . . . , y), d(y, . . . , y)}) £ EQUIV(C) then 
also (S,{Co{y, ■ ■ ■ ,y),Ci(y, . . . ,y)}) £ ISO(C) by the identity permutation 
7T = id. For the other direction note that S CSP(C) iff #i(S) > 0. 
Now suppose (S,{Co(y, . . . ,y),Ci(y, . . . ,y)}) £ ISO(C), then by Lemma [l9] 
#i(S) = #i({Co(y, ■ ■■,?/), Ci(y, . . . ,y)}), which is clearly a contradiction since 
# 1 ({C (y,...,y),C 1 (y,...,y)}) = 0. 

Next, we claim, analogously to Claim |l5|, that 

(1) if C is 1-valid, not Schaefer, and not 0-valid, then ISO(C) is coNP-hard; and 

(2) If C is 0-valid, not Schaefer, and not 1-valid, then ISO(C) is coNP-hard. 

For the first case, we use the same reduction as in the proof of Claim [l3| Note 
that if constraint set S is equivalent to {C(xi, . . . , xi) | 1 < i < n}, then 

{C(xi, . . . , Xj) | 1 < i < n}) £ ISO(C) via tt = id. For the other direction 
note that S g CSP^i(C) iff #i(5) > 2, but #i({C(xi, . . . , Xi) \ 1 < i < n}) = 1. 
By Lemma |l9| the result follows. The proof of the second case is similar. 

The remaining case is that of a set C not Schaefer, but both 0-valid and 1- 
valid. We use the same reduction as in Claim [l5| Clearly if (5, U) £ EQUIV(C) 
then also (S,U) £ ISO(C) via tt = id. To show the other direction note that 
if S i CSP #0 ,i(C) then #!(5) > 3, but #i(A™ =1 x. ( V A^i^) = 2 - Now use 
Lemma |l^ to show that S is not isomorphic to one of the formulas constructed 
in the cases examined in Claim [15]. This completes the proof of the theorem. □ 

To complete the proof of Theorem [l8| it remains to show that if C is Schaefer, 
then ISO(C) and ISO c (C) are polynomial-time many-one reducible to GI. We will 
reduce ISO c (C) to graph isomorphism for vertex-colored graphs, a GI variation 
that is polynomial-time many-one equivalent to GI. 

Definition 21. VCGI is the problem of, given two vertex- colored graphs Gi — 
(Vi, Ei,Xi), i £ {1, 2}, Xi '■ V ~^ N, to determine whether there exists an isomor- 
phism between G\ and G2 that preserves colors, i.e., whether there exists a bijec- 
tionir:Vi — > V<i such that {v , w} £ E\ iff {ir(v),ir(w)} £ E<x and x( v ) = x( 7r (' 1 '))- 



Proposition 22 ([Fon76, BC79[ ). VCGI is polynomial-time many-one equiv- 
alent to GI. 



By Proposition |2^, to complete the proof of Theorem [18], it suffices to show 
the following. 

Theorem 23. Let C be a set of constraints. If C is Schaefer, then ISO c (C) <? m 
VCGI. 



Proof. Suppose C is Schaefer, and let S and U be sets of constraint applications 
of C with constants over variables X. We will first bring S and U into normal 
form. 



Let S be the set of all constraint applications A of C with constants such that 
all of j4's variables occur in X and such that S — > A. Similarly, let U be the set 
of all constraint applications B of C with constants such that all of £Ts variables 
occur in X and such that U — ► B. It is clear that S = S, since S C 5 and 
5 — * 5. Likewise, U = U. Note that 5 and i7 are polynomial-time computable 
(in 1(5, f7) |), since 

1. there exist at most ||C||(||X|| + 2) k constraint applications A of C with con- 
stants such that all of A's variables occur in X, where k is the maximum 
arity of constraints in C; and 

2. since C is Schaefer, determining whether S — > A or E/ — » A takes polynomial 
time, using the same argument as in the proof of Theorem [?]. 

Note that we have indeed brought S and U into normal form, since if S = U 
then S = U, so for any permutation tt of X, if 7r(5) = U, then 7r(S l ) = C7. We 
remark that this approach of first bringing the sets of constraint applications into 
normal form is also followed in the coIP[l] NP upper bound proof for isomorphism 
between Boolean formulas 

It remains to show that we can in polynomial time encode S and U as vertex- 
colored graphs G(S) and G(U) such that there exists a permutation tt of X with 
ir{S) =Uif and only if (G(S),G(U)) £ VCGI. 

Let C = {Cx,...,C m }, let P = {C^ (in, sc 12 , . . . , x lkl ), 
C i2 (x 2 i, X22, ■•■ : x 2fc 2 ),•■■, C ie (x ei , xe 2 , ■■■ , X£k e)} be a set of constraint appli- 
cations of C with constants over variables X such that i\ < ii < i% < ■ ■ ■ < ig. 
Define G(P) = (V, E, x) as the following vertex-colored graph: 

- V = {0, 1} U { x | x G X } U { a,ij | 1 < i < t, 1 < j < h } U { A t | 1 < i < 
£}. That is, the set of vertices corresponds to the Boolean constants, the 
variables in X, the arguments of the constraint applications in P, and the 
constraint applications in P. 

- E = { {x, atj} | x = Xij } U { {dij, Ai) | 1 < i < £, 1 < j < h }. 

- The vertex coloring x w iH distinguish the different categories. Of course, we 
want to allow any permutation of the variables, so we will give all elements 
of X the same color. In addition, we also need to allow a permutation of 
constraint applications of the same constraint. 

• x(o) = o, x(l) = l, 

• xi x ) — 2 for all x G X, 

• x( A r) = 2 + j if i r = j, and 

• x( a ij) = 2 + m + j. (This will ensure that we do not permute the order 
of the arguments.) 

If there is a permutation ir of X such that ir(S) = U, it is straightforward to see 
that (G(S),G(U)) G VCGI. On the other hand, if (G(S),G(U)) G VCGI via a 
permutation tt of the vertices of G(S), then note that vertices corresponding to 
constraint applications can only be permuted together with those vertices corre- 
sponding to the arguments of that constraint application. In addition, because 
of the coloring, the order of arguments is preserved. Thus, if ir(Ai) — Aj then 



ATOP | . 



necessarily 7r(aj r ) = a,j r , for all 1 < r < ki and Ai and, because coloring is 
preserved, Ai and Aj are instances of the same constraint. This part of the per- 
mutation corresponds to a permutation of the constraint applications in the set 
S. The remaining part of the permutation in G(S) is one that solely permutes 
vertices corresponding to variables in S, so ir(S) — U. □ 
Note that the construction used in proof of the previous theorem can be used 
to provide a general upper bound on ISO c (C): Given sets S and U of constraint 
applications of C with constants, first bring S and U into the normal form (S 
and U) described in the proof of the previous theorem (this can be done in 
polynomial time with parallel access to an NP oracle), and then determine if 
there exists a permutation tt such that ir(S) = U (this takes one query to an NP 
oracle) . The whole algorithm takes polynomial time with two rounds of parallel 



queries to NP, which is equal to PfP (Buss and Hay fBH91|). Thus, we have the 



following upper bound on the isomorphism problem for contraint satisfaction: 

Theorem 24. Let C be a finite set of contraints. ISO(C) and ISO c (C) are in 
Pj^ p 

Finally, we show that for some simple instances of Horn, bijunctive, and 
affinc constraints, the isomorphism problem is in fact polynomial-time many- 
one equivalent to the graph isomorphism problem. Proofs of these results will 
be given in the appendix. 

Theorem 25. GI is polynomial-time many-one equivalent to ISO({{(0, 1), 
(1, 0), (1, 1)}}) and to ISO C ({{(0, 1), (1, 0), (1, 1)}}). 

Proof. It suffices to show that GI <£, ISO({{(0, 1), (1, 0), (1, 1)}}), since, by 
Theorem ||iSO c ({{(0,1), (1,0), (1,1)}}) <f„ GI. 

Let G = (V, E) be a graph and let V — {1,2,..., n}. We encode G in the 
obvious way as a set of constraint applications: S(G) — {xi VXj \ € E}. It 

is immediate that if G and H are two graphs with vertex set {1, 2, . . . , n}, then 
G is isomorphic to H if and only if S(G) is isomorphic to S(H). □ 

Note that the c onstrain t {(0, 1), (1,0), (1, 1)} is the binary constraint x V y, 



denoted by ORq in [CKS00J. Theorem can alternatively be formulated as: GI 



is polynomial-time many-one equivalent the isomorphism problem for positive 
2CNF formulas (with or without constants). Also, from | TorOC| , we conclude 



that this isomorphism problem thus is hard for NL, PL, and DET. 

Theorem 26. GI is polynomial-time many-one equivalent to IS0({{(1, 0, 0), 
(0, 1,0), (0, 0, 1), (1, 1, 1)}}) and to IS0 C ({{(1, 0, 0), (0, 1,0), (0, 0, 1), (1, 1, 1)}}). 

Proof. It suffices to show that GI <P m IS0({{(1, 0, 0), (0, 1, 0), (0, 0, 1), (1, 1, 1)}}), 
since, by Theorem || IS0 C ({{(1, 0, 0), (0, 1,0), (0, 0, 1), (1, 1, 1)}}) <^ GI. 

Let G = (V, E) be a graph, let V — {1,2, ... , n}, and enumerate the edges 
as E = {ei, e%, . . . , e m }. We encode G as a set of XOR3 constraint applications 
in which propositional variable X4 will correspond to vertex i and prepositional 
variable ?/, will correspond to edge e,. We encode G as S(G) — Si(G) U 82(G) U 
S 3 (G) where 



— Si(G) = {xi © xj © yk | e k = {i,j}} (5i(G) encodes the graph), 

— 52(G) = {xi ffiZi ® z[ I i £ V} (52(G) will be used to distinguish x variables 
from y variables), and 

— 53(G) = {yi © yj © yk I ej, ej, and form a triangle in G}. Note that for 
every A £ Ss(G), 5i(G) — > A. We add these constraint applications to 5(G) 
to ensure that 5(G) is a maximum set of XOR3 formulas. 

We will show later that if G and H are two graphs with vertex set {1, 2, . . . , n} 
without isolated vertices, then G is isomorphic to H if and only if 5(G) is 
isomorphic to S{H). 

The proof of the theorem relies on the following lemma, which shows that 
5(G) is a maximum set of XOR3 formulas. This is an important property, 
since checking whether two maximum sets of functions are equivalent basically 
amounts to checking whether the sets are equal, as will be explained in detail 
after the proof of the lemma. 

Lemma 27. Let G = (V, E) be a graph such that V = {1,2, ... ,n} and E = 
{ei, ei, ■ ■ ■ , e m }. Then for every triple of distinct propositional variables a, b, c in 
5(G), the following holds: If 5(G) a © b © c, then a © b © c £ 5(G) . Note: we 
view a © b © c as a function, and thus, for example, a (B b (B c — c © a © 6. 

Proof. Suppose that there exists a triple of distinct propositional variables a, 
b, and c in 5(G) such that 5(G) a © b © c and a © b © c £ 5(G). Let 
X = {xi I i £ V}, Y = {yi \ a £ E}, and Z = {z l7 z[\i£ V}. Without loss of 
generality, assume that a < b < c, where < is the following order on X UY Li Z: 

x\ < ■ ■ ■ < x n < y x < ■ ■ ■ < y m < zi < ■ ■ ■ < z n < z[ < ■ ■ ■ < z' n . 

The proof consists of a careful analysis of different sub-cases. We will show 
that in each case, there exists an assignment on XLIYLIZ such that that satisfies 
5(G) but not (a©6©c), which contradicts the assumption that 5(G) — > affi&ffic. 

It is important to note that any assignment to X can be extended to a 
satisfying assignment of 5(G). 

1. If exactly three or exactly one of the variables in {a,b,c} are in Z, then 
consider the assignment that assigns to every variable in Z and 1 to every 
variable inXuy. Clearly, this assignment satisfies 5(G) but not (affi&ffic). 

2. If exactly two of the variables in {a, b, c} are in Z, then, since a < b < c, b 
and c are in Z. We consider the the following two sub-cases, depending on 
whether a £ X or a £ Y. 

(a) If a £ X, then set b and c to 1 and a to 0. Since by assumption a © 
b © c ^ 5(G), it is easy to see that this assignment can be extended to 
an assignment on X U Z that satisfies x» © © z\ for all i £ V. This 
assignment in turn can be extended to an assignment on X U Y U Z 
that also satisfies every constraint application of the form x, © Xj © yk 
for ek — {i,j}- So, we now have an assignment that satisfies 5i(G) and 
52(G). Since S\(G) —> A for every constrain application A £ 53(G), it 
follows that this assignment also satisfies 5(G) while it does not satisfy 
(affifrffic). 



(b) If a E Y, let a = yk where = If {6, c} = {ze,z' e } for some £, 

then we set to 1. In all cases, set exactly one of {xi,Xj} to 1 (this 
could be xg). Set all other elements of X to 0. We can extend this to a 
satisfying assignment of 5(G) that does not satisfy (a © b © c). 

3. If a, &, and c are in X, then set a, b, and c to 0. It is easy to see that 
this assignment can be extended to an assignment on X U Z that satisfies 
Xi © Zi © Zj' for all i £ V. This assignment in turn can be extended to an 
assignment on X UY U Z that also satisfies Xi®Xj®yk for = {i, j}. So, we 
now have an assignment that satisfies 5(G) but does not satisfy (a © b © c). 

4. If c E y and a and 6 are in X, suppose that c = yk and let = By 
the assumption that a © b © c is not in 5(G), at least one of a and 6 is not 
in {x^ Xj}. 

Without loss of generality, let a g" {xi,Xj}. Set a to and set X \ {a} to 1. 
This assignment can be extended to a satisfying assignment for S(G). Note 
that such an assignment will set y^ to 1. It follows that this assignment does 
not satisfy a © b © c. 

5. If a E X and 6 and c are in Y, then set a to and b and c to 1. It is easy to 
see that this can be extended to a satisfying assignment for S(G). 

6. If a, b, and c are in Y, let a = 2/^,6 = y/c 2 i c = 2/fc 3 such that = {it,jt} 
for £ E {1,2,3}. First suppose that for every £ E {1,2,3}, for every 
x E {x ii ,Xj l! }, there exists an £' E {1,2,3} with £' ^ £ and a constraint 
application A in 5(G) such that x and j/fe , occur in A. This implies that 
every vertex in ji, £2, ji, 13, J3} is incident with at least 2 of the edges 
in efej , e/c 2 , efc 3 . Since these three edges are distinct, it follows that the edges 
efcj , e/c 2 , efe 3 form a triangle in G, and thus y^ © j/fe 2 © yu 3 € 5(G). This is a 
contradiction. 

So, let £ E {1,2, 3}, a; E {x i{ , ^ } be such that for all £' E {1,2,3} with £ ± £', 
a; and , do not occur in the same constraint application in 5(G). Set x to 
and set X \ {x} to 1. This can be extended to a satisfying assignment of 
5(G) and such a satisfying assignment must have the property that yk e is 
and y k( , is 1 for all I' E {1, 2, 3} such that £' ^ I. □ 

How can Lemma |27]help us in the proof of Theorem ^? Note that if 5 and 
T are maximum sets of C constraint applications, then S = T if and only if 
S = T. Here equality should be seen as equality between sets of functions, i.e., 
a ® b ® c — 6fficffia etc. So 5 is isomorphic to T if and only if there exists a 
permutation p of the variables of 5 such that p(S) = T. 

We will now prove Theorem Let G and H be two graphs. Remove the 
isolated vertices from G and H. If G and H thus modified do not have the same 
number of vertices or they do not have the same number of edges, then G and H 
are clearly not isomorphic. If G and H have the same number of vertices and the 
same number of edges, then rename the vertices in such a way that the vertex 
set of both graphs is V = {1, 2, ... , n}. Let {ei, . . . , e m } be an enumeration of 
the edges in G and let {e[, . . . , e' m } be an enumeration of the edges in H. 

We will show that G is isomorphic to H if and only if 5(G) is isomorphic to 
S(H). 



The left-to-right direction is trivial, since an isomorphism between the graphs 
induces an isomorphism between sets of constraint applications as follows. If 
7r : V — > V is an isomorphism from G to H, then p is an isomorphism from 5(G) 
to S(H) defined as follows: 

- p(xi) = x„(q, p(zi) = Zn(i), pi'4) = z'^y for i E V. 

- For e k = {i,j}, p(y k ) = y t where e' e = {%({), 

For the converse, suppose that p is an isomorphism from 5(G) to S(H). 
By the observation above, p(S(G)) = S(H). Now look at the properties of the 
different classes of variables. 

1. Elements from X are exactly those variables that occur at least twice and 
that also occur in an element of 5 (G) together with two variables that occur 
exactly once. So, p will map X onto X. 

2. Elements of Z are those variables that occur exactly once and that occur 
together with an element from X and another element that occurs exactly 
one. So p will map Z to Z. 

3. Everything else is an element of Y. So, p will map Y onto Y. 

For i E V, define 7t(i) = j iff p(xi) = Xj. tt is 1-1 onto by observation (1) above. 
It remains to show that {i,j} S E iff {7r(i), 7r(j)} G E' . Let e k = {hj}- Then 
Xi®Xj (Be k G 5(G). Thus, /o(x,) ® p{x-j) &) p(y k ) G That is, ©a^y) © 

p(yfc) G S(H). But that implies that p{y k ) = yi where e' g = {7r(j), 7r(j)}. This 
implies that {7r(i), 7r(j)} G E 1 . For the converse, suppose that {7r(i), 7r(j)} G E' . 
Then x^iy © x^u\ © ye G S*(iJ) for = {7r(i), 7r(j')}. It follows that Xi © © 
p~ x {yi) G 5(G). By the form of 5(G), it follows that {i,j} EE. □ 
Note that the constraint {(1, 0,0), (0 1, 0), (0, 0, 1), (1, 1, 1)}) is the constraint 



x © y © z, denoted by XOR 3 in [CKS00| 



Note that the previous proof shows ISO(NXOR 3 ) and ISO c (NXOR 3 ) are 
many-one equivalent to GI (just negate all variables). And we can replace the 3 
by any k > 3 (just use duplicate copies of variables). 

From Theorems |25| and ^6|, we conclude that, if we could show that isomor- 
phism for bijunctive, anti-Horn (and, by symmetry, Horn) or affine CSPs is in 
P, then the graph isomorphism problem is in P, settling a long standing open 
question. 
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